Reinforcement learning and human behavior.
نویسندگان
چکیده
The dominant computational approach to model operant learning and its underlying neural activity is model-free reinforcement learning (RL). However, there is accumulating behavioral and neuronal-related evidence that human (and animal) operant learning is far more multifaceted. Theoretical advances in RL, such as hierarchical and model-based RL extend the explanatory power of RL to account for some of these findings. Nevertheless, some other aspects of human behavior remain inexplicable even in the simplest tasks. Here we review developments and remaining challenges in relating RL models to human operant learning. In particular, we emphasize that learning a model of the world is an essential step before or in parallel to learning the policy in RL and discuss alternative models that directly learn a policy without an explicit world model in terms of state-action pairs.
منابع مشابه
The Effect of Electronical Media on the Reinforcement of Social Behavior of Youth from the Computer Course Professors and Students Viewpoints of Sari Islamic Azad University
The goal of research was the effect of electronical learning media on the reinforcement of youth social behavior from the point of view of computer course professors and students of Islamic Azad University of Sari. The statistical population was included of all computer students and professors of I.A.U of Sari. The statistical sample was identified by using of the sample content identification ...
متن کاملMulticast Routing in Wireless Sensor Networks: A Distributed Reinforcement Learning Approach
Wireless Sensor Networks (WSNs) are consist of independent distributed sensors with storing, processing, sensing and communication capabilities to monitor physical or environmental conditions. There are number of challenges in WSNs because of limitation of battery power, communications, computation and storage space. In the recent years, computational intelligence approaches such as evolutionar...
متن کاملReinforcement Learning Based PID Control of Wind Energy Conversion Systems
In this paper an adaptive PID controller for Wind Energy Conversion Systems (WECS) has been developed. Theadaptation technique applied to this controller is based on Reinforcement Learning (RL) theory. Nonlinearcharacteristics of wind variations as plant input, wind turbine structure and generator operational behaviordemand for high quality adaptive controller to ensure both robust stability an...
متن کاملDifferential effects of reward and punishment in decision making under uncertainty: a computational study
Computational models of learning have proved largely successful in characterizing potential mechanisms which allow humans to make decisions in uncertain and volatile contexts. We report here findings that extend existing knowledge and show that a modified reinforcement learning model, which has separate parameters according to whether the previous trial gave a reward or a punishment, can provid...
متن کاملA method to embed human knowledge to reinforcement learning method∗
Reinforcement learning is a framework to learn from delayed reward and punishment for a model of both animal and robot learning. To make it more practical to design an intelligent machine, it would be better to be able to combine with human knowledge. This paper presents a method to introduce such a knowledge into a reinforcement learning system by embedding it as an intrinsic behavior, just li...
متن کاملScaling up Reinforcement Learning with Human Knowledge as an Intrinsic Behavior
Abstract. Reinforcement learning is a framework to learn from delayed reward/punishment for a model of both animal and robot learning. To make it more practical to design an intelligent machine, it would be better to be able to combine with human knowledge. This paper presents a method to introduce such a knowledge into a reinformcement learning system by embedding it as an intrinsic behavior, ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Current opinion in neurobiology
دوره 25 شماره
صفحات -
تاریخ انتشار 2014